A Survey of Feature Selection Strategies for DNA Microarray Classification
نویسندگان
چکیده
Classification tasks are difficult and challenging in the bioinformatics field, that used to predict or diagnose patients at an early stage of disease by utilizing DNA microarray technology. However, crucial characteristics technology a large number features small sample sizes, which means confronts "dimensional curse" its classification because high computational execution needed discovery biomarkers difficult. To reduce dimensionality find significant can employ feature selection algorithms not affect performance tasks. Feature helps decrease time removing irrelevant redundant from data. The study aims briefly survey popular methods for classifying technology, such as filters, wrappers, embedded, hybrid approaches. Furthermore, this describes steps process accomplish their relationships other components datasets, cross-validation, classifier algorithms. In case study, we chose four different on two-DNA datasets evaluate discuss performances, namely accuracy, stability, subset size selected features. Keywords: Brief survey; data; selection; filter methods; wrapper embedded methods. DOI: 10.7176/CEIS/14-2-01 Publication date: March 31 st 2023
منابع مشابه
A Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization
Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملH-BwoaSvm: A Hybrid Model for Classification and Feature Selection of Mammography Screening Behavior Data
Breast cancer is one of the most common cancer in the world. Early detection of cancers cause significantly reduce in morbidity rate and treatment costs. Mammography is a known effective diagnosis method of breast cancer. A way for mammography screening behavior identification is women's awareness evaluation for participating in mammography screening programs. Todays, intelligence systems could...
متن کاملFeature Selection by Weighted-SNR for Cancer Microarray Data Classification
Feature selection technique is widely used to improve the high dimensional data analysis especially in a classification task. Cancer microarray data classification task belongs to this category. There are many researches that study the feature selection of microarray data classification. The major problem is that many feature selection methods must pre-define the number of feature. Unfortunatel...
متن کاملFeature selection in independent component subspace for microarray data classification
A novel method for microarray data classification is proposed in this letter. In this scheme, the sequential floating forward selection (SFFS) technique is used to select the independent components of the DNA microarray data for classification. Experimental results show that the method is efficient and feasible. r 2006 Elsevier B.V. All rights reserved.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Engineering and Intelligent Systems
سال: 2023
ISSN: ['2222-1727', '2222-2871']
DOI: https://doi.org/10.7176/ceis/14-2-01